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Abstract 

Sparse representation of images under certain transform domain has been playing a fun¬ 
damental role in image restoration tasks. One such representative method is the widely used 
wavelet tight frame. Instead of adopting fixed filters to sparsely model any input image, a data- 
driven tight frame was proposed and shown to be very efficient for image denoising very recently. 
However, in this method the number of framelet filters used for constructing a tight frame is the 
same as the length of filters. In fact, through further investigation it is found that part of these 
filters are unnecessary and even harmful to the recovery effect due to the influence of noise. 
Therefore, an improved data-driven sparse representation systems constructed with much less 
number of filters are proposed. Numerical results on denoising experiments demonstrate that the 
proposed algorithm overall outperforms the original tight frame construction scheme on both the 
recovery quality and computational time. 

Key words: data-driven; image denoising; sparse representation; signal subspace; noise 
subspace. 


1. Introduction 

Image denoising, which aims to estimate the clean image from its noise-corrupted observa¬ 
tion, is a fundamental topic in the field of computer vision and image processing. Two types 
of commonly encountered noise are additive white Gaussian noise (AWGN) and multiplicative 
noise (MN). AWGN is often introduced in the optical imaging due to the thermal motion of 
electron in camera sensors or circuits, and MN (also called speckle) often appears in coherent 
imaging systems which are different from the usual optical imaging, e.g., synthetic aperture radar 
(S AR) or ultrasound imaging. By taking the logarithmic transform of the data, the MN or speckle 
noise model can be converted into a tractable additive noise lilli . Then many denoising tech¬ 
niques which are widely used for AWGN can also be applied for multiplicative noise removal 
In this article, we focus our attention on AWGN denoising problem. 

In the last several years, the sparsity-inducing prior based on transformed frames has been 
playing a very important role in the development of effective image denoising methods. The 
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fundamental idea behind is that the interest image or image patches can be sparsely represented 
W properly designed transformed frames. One such popular method is the wavelet tight frames 
iSl^, which use fixed functions such as the linear or cubic B-spline as the generators and can 
be obtained by the unitary extension principle (UEP). Wavelet tight frames are widely used for 
image restoration tasks due to its strong ability of sparsely approximating piecewise smooth re¬ 
gions and preserving the jump discontinuities, and the low computational cost QiSEIil- 
However, the structure of images varies greatly in practice, and a tight frame working well for 
one type of images may be not suitable for another. In order to properly represent the textures 



imization algorithm to learn an over-complete dictionary from the noisy image patches, and use 
the learned dictionary to sparsely model and denoise images. These dictionary learning meth¬ 
ods improve the quality of denoised images. However, the corresponding computational cost is 
too high due to the fact that the sparse coding of a large number of image patches is required 
during each iteration. Recently, the patch-dictionary learning methods were combined with an¬ 
other widely used nonlocal techniques E 1^ . and various novel image restoration models 
were generated. These nonlocal sparsity representation methods were 
among the current state-of-the-art approaches in the quality of recovered images. However, the 
corresponding computational cost is really too high due to the computation of patch-similarity 
weights, patches group and sparse coding for a large number of image patches. In fact, only the 
computation of similarity weights will spend much time. 

Very recently, an adaptive tight frame construction scheme was proposed to overcome the 
drawback of the traditional wavelet tight frames ||2^ . Differently from the existing tight framelet 
methods, the main idea of the new approach is to learn a tight frame from the input image. 
Through minimizing a problem with respect to the framelet filters and the canonical frame co¬ 
efficient vectors, a data-driven tight frame can be efficiently learned by several iteration steps 
of simple singular value decomposition (SVD) and hard thresholding operator. Numerical ex¬ 
periments demonstrate that, with comparable performance in image denoising, the proposed 
data-driven tight frame construction scheme runs much faster than the general dictionary learn- 
ing methods such as the classical K-SVD method. The proposed variational model in 12511 is a 
non-convex model and the convergence of the corresponding alternating minimization algorithm 
is further investigated in the recent literature 12^ . which proves the sub-sequence convergence 
property of the algorithm. The data-adaptive tight framelet has also been directly allied to 
remove the multiplicative noise by simply using the logarithmic transform of the data yj]. 

In the above-mentioned data-driven tight frame construction algorithm, the framelet filters 
are updated by the simple SVD of a matrix with the size of x at each iteration, where p x p 
is the size of the chosen filters. Therefore, the number of framelet filters learned from the input 
image is the same as the length of the support set of filters, which is also equal to p^. However, 
in the implementation process of the proposed algorithm we observe that a majority of these 
learned framelet filters are unnecessary and even harmful to the denoised results due to the fact 
that the noise obviously influences the learned Alters that correspond to the features that seldom 
appear in the input image. Based on this observed phenomena, we divide the matrix used for 
generating the framelet Alters into two part, which correspond to the signal subspace and noise 
subspace respectively. Then we further propose an improved data-driven sparse representation 
method with much less number of Alters learned from the bases of the signal subspace. Due to 
the removal of Alters which are useless for the sparse representation of the image features, the 
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computational cost is further reduced and the quality of denoising results is not damaged at the 
same time. The proposed data-driven framelet construction scheme is compared with the original 
algorithm with application to AWGN denoising problem. 

The rest of this paper is organized as follows. In section we briefly review the classi¬ 
cal K-SVD algorithm and the recently proposed data-driven tight frame construction algorithm. 
In section [3l in order to overcome the drawback of the previous work, we propose a new fil¬ 
ters learning method, which excludes the inessential framelet filters. The convergence of the 
proposed algorithm is further investigated. In section |4] the numerical examples on AWGN de¬ 
noising are reported to compare the proposed algorithm with the original data-driven tight frame 
construction scheme. 


2. Data-driven tight frame construction scheme 


In this section, we briefly introduce some preliminaries of wavelet tight frames, and then 
review and compare the classical K-SVD algorithm and the recently proposed data-driven tight 
frame construction scheme. 

Starting with a finite set of generators 'T = {'Pi, ■ • • , 'P„,), one wavelet frame for TaCR) is just 
the affine system generated by the shifts and dilations of 'P, i.e., it is defined by 

X('P) = l<q<m,jeZ,ke z} 

where ^qj,k - 2''^^'P^(2'' ■ -k). Such system 2f('P) is called a tight frame of L2(R) if it satisfies 
/ = for ^riy / e T 2 (R)- A tight frame has two associated operators: one is the 

analysis operator W defined by 

W: MeL2(R)^ 


and the other is the adjoint operator (also called the synthesis operator) defined by 




* ^ 'y j qj,k ^ T2(M). 


The construction of wavelet tight frame is using the multi-resolution analysis (MRA). In the dis¬ 
crete setting, it can be generated by the shifts and dilations of a set of masks or filters {hi, - ■ ■ , h„,} 
associated with the corresponding generators 'P. For more details refer to ||3,S]- The framelets 
for LaiM?-) can be easily constructed by taking tensor products of 1-dimensional framelets. 

However, in the tight frame systems introduced above, the filters are fixed and unable to 
be adaptively adjusted according to the input image. Therefore, the approaches of learning an 
over-complete dictionary from the image were widely developed in recent years. One of the 
representative work is the K-SVD method 11411 . Here we give a brief review on this method. 

Let / and g denote the noise-free image and the noisy image respectively, and | ^ 

be the set of image patches of size p x p densely extracted from the image /. Let D — 
{di,d 2 , ■ ■ ■ ,d„^ e (Jejjote the dictionary whose column vectors represent the dictionary 

atoms. Then the K-SVD method for image denoising is expressed by the following minimization 
problem: 


min 

/,v.6 


5 "^' 


f\\l + J]'^iMh+pY,^\Dvr 


Ml 


( 2 . 1 ) 
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where v is the the sparse coding coefficient composed by v,, and v, denotes the expansion coeffi¬ 
cient vector of the i-th image patch over the dictionary D. 

The performance of the K-S VD method outperform the wavelet tight frames in image denois- 
ing due to the fact that the repeating textures or features of images are likely to be captured by 
the learned dictionary atoms. However, the corresponding minimization problem (12.11) is difficult 
to be solved efficiently. An alternating iterative algorithm is adopted to update the values of /, v 
and D respectively in literature Q. Specifically, the sparse coding coefficient is calculated in 
each iteration by the orthogonal matching pursuit (OMP), which is quite slow and accounts for 
most of the computational amount of this method. 

Very recently, Cai et.al ll2^ further proposed a variational model to learn a tight frame sys¬ 
tem from the image itself. The tight frames considered there are single-level un-decimal discrete 


wavelet systems generated by the data-driven filters {hi, - ■■ , For each hi, let hi be its col¬ 


umn vector formed by concatenating all its columns. Define H - 


h\,h2. 



, and W{H) and 


W^(H) be the analysis operator and the synthesis operator generated by the filters //respectively. 
The problem of learning the filters H from the image is still difficult for the arbitrary m. There¬ 
fore, the authors in ll2^ consider a special case of m - and each /i, be a real-valued filter with 
support on (~i [1, p]^. In this situation, it has been proved that the constraint W^(H)W{H) - I 
exists as long as H - -^1. The interested reader can refer to Proposition 3 of ll25ll for details. 

Based on this conclusion, a novel variational model for data-driven tight frame construction 
is proposed in which can be briefly summarized as follows: 


min ||v-IT(//)g|| 2 -H/iol|v||o, s.t. //^//= ^/ (2.2) 

where v is the coefficient vector that sparsely approximates the tight frame coefficient W(H)g. 
Compared with the minimization problem (12.11 ). it is observed that the coefficient v in problem 
(12.2b can be simply obtained by the hard thresholding operator, and hence the new method avoids 
the huge computational burden of updating the coefficient v in the K-SVD method. Intuitively, 
the K-SVD method (12.1b can be seen as a synthesis-based model for sparse representation; and 
the new variational model (12.2b is just an analysis-based model which can be solved more easily 
and thus the computational efficiency can be improved greatly. 

The problem (12.2b can be rewritten in the matrix form. For each patch g,-, let v, denote the 
vector corresponding to H^'gi 6 . Define 


^ ~ Jj 

Then problem (12.2b is equal to 


min||y-D^G||i H-d^||y||o, s.t. D‘D = I. 

V.D 


(2.3) 


(2.4) 


where A represents some fixed regularization parameter. In literature 02511 . this problem is solved 
via an alternating minimization scheme between the coefficients V and the learning filters D. 
More specifically, given the current iteration value of the variable D, it is easy to deduce 
that the optimal value V of the minimization problem (12.4b can be simply obtained by the hard 
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thresholding operator. In what follows, fixed the value of V as Vk+\, the next iteration update for 
D is given by the minimization problem 


minliy^+i-D^GIli, s.t. D = I. 

D 


(2.5) 


Let UAXl be the SVD of GVi 


w /^j 6 M-P such that = IJuX-kX^. It has been shown in 

02-511 that the minimization problem (12.5b has a closed solution denoted by 


= UkXl. ( 2 . 6 ) 

It is observed that the filters set D associated with the framelet can be obtained by a simple SVD 
of a matrix of small size x p^. 


3. Improved data-driven sparse representation method 

3.1. Motivation and model description 

In the above-mentioned algorithm, the matrix D associated with the framelet filters is updated 
by the iterative formula (12.6b . where Uk and Xk are given by the SVD of Assume that 

Si = diag{ri}, where denotes the singular value of in a descending order. Let 

Uk — Mp • • • > “i I and Xk - |.r^, • • • , |. Then GVj^j can be reformulated as 

= X riu[{x[Y = ^ ^ riu[{x[Y . (3.7) 

1=1 i-l i=s+l 

signal subspace noise subspace 

In the denoising problem, the input image g used for generating the matrix G is contaminated 
by noise, thus so is the matrix GVj^j. Therefore, we can divide the whole space spanned by the 

bases {Mi(-^i)^}^_j into two orthogonal subspace; the signal subspace which is spanned by the 
bases corresponding to the s{s < p^) largest singular values; and the noise subspace which is 
spanned by the bases corresponding to the rest singular values. Then in the formula (13.7b the 
matrix GVj^j is divided into two part-one is the signal component obtained by projection on the 
signal subspace, the other is the noise component obtained by projection on the noise subspace. 

S . . P~ . . P' . . 

Rewrite regarded as 

1=1 1=5+1 1=5+1 

the filters component mainly learned from the noise component of the input image, and hence 
maybe useless and even harmful to the final denoised results. Therefore, it should be removed in 
the filters learning process. 

To illustrate this conclusion more clearly, we use the Barbara image of size 256 x256 contam¬ 
inated by zero-mean AWGN of cr = 25 as an example, where cr denotes the standard deviation 
of the noise. Figure [T(c)] shows the curve of the singular values of GV^{K = 25). It is observed 
that the singular values r, decrease very fast with the increase of the index i. This phenomenon 
implies that the framelet filters corresponding to the last several singular values of GVj^j will be 
influenced by the noise more obviously compared with those corresponding to the larger singular 
values. In fact, the observation in Figure [T(c)| reveals the low-rank properties of the features of 
images. It is well known that the low-rank property has been widely utilized for developing image 
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Figure 1: (a) The original Barbara image, (b) the noisy image, (c) the final singular values of after 25 iteration 

steps, (d) corresponding adaptive tight frame filters, (e) frame filters generated by the bases of signal subspace, (f) frame 
filters generated by the bases of noise subspace. 


restoration models Numerical examples in section 0] also verify the low-rank prop¬ 

erties. Figure [T(d)| presents the obtained adaptive tight frame filters generated by 
is found that many filters included in the figure are likely to be contaminated by the noise, e.g., 
the second to seventh filters in the first row, and the fifth to seventh filters in the second row of 
Figure [T(d)| This is due to the fact that these filters are generated by the bases corresponding to 
the last several singular values of and hence influenced by the noise obviously. Figures 

|l(e)| and |l(f)| also show the frame filters generated by where we 

choose i = 30 (The selection of s is influenced by the amount and pattern of repeating textures or 
features and the noisy level. Through many trials for different values of s, we found that i = 30 
is a suitable value for separating the signal subspace and noise subspace of the Barbara image). It 
is clearly observed that the filters contained in the noisy subspace have been severely influenced 
by the noise, and hence are not suitable for the sparse representation of the image. 

Motivated by the above analysis, we propose a new strategy to remove the useless framelet 
filters and develop an improved data-driven sparse representation method. Assume that only 
s{s < p^) filters including in the filters group H are suitable for the sparse representation 
of the noisy image. Without loss of generality, we group filters H into two classes; H\ - 


hi,h2, ■ ■ ■ ,hs 


and H 2 - 




^), which correspond to the 


bases of the signal subspace and the noise subspace respectively, and hence only Hi is suitable 
for the sparse modeling of the noisy image. Then similarly to (I2.21 i. a variational model for the 
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sparse representation of the image is proposed as follows 


min||v-lV(//i)g||2 + do||v||o, 

v,Hi 

, 1 (3.8) 

s.t. H\Hi = —I. 

P 

The proposed model (I3.81 l can only demonstrate that the image can be sparsely represented by the 
biters Hi. However, it cannot guarantee that Hi is generated by the bases of the signal subspace. 
Due to the fact that Hi captures the strong features of the image, the following constraint 

max\\W(Hi)g\\l (3.9) 

^1 

should be satisbed for the biters Hi. It is noticed that 

\\W(Hi)g\\l + \\W(H2)g\\l = 
g^W(Hi/W(Hi)g + g^W{H2VW(H2)g 
- g^W(H)^W(H)g = g^g - const. 

Therefore, max \\W{Hi)g\'^ is equal to 

inin||lT(//2)g||i (3.10) 

Hi 

The model (13.101) demonstrates that H 2 corresponds to the projection onto the noise subspace, in 
other words. Hi corresponds to the projection onto the signal subspace. 

Combining the models (13.81 ) and (13.101 ) we further propose a variational model for adaptive 
biters learning as follows 


min ||v - W(//i)g ||2 + \\W{H 2 )g \'^2 + 4ll^llo> 

v,H\ Ml 

T 1 

S.t. H^H = -./. 

P 


(3.11) 


It is noteworthy that the second term in (13.111 ) plays the role of eliminating the biters from the 
noise subspace, and hence cannot be removed. More precisely, in the example shown in Figure 


[T] if we choose HI 



p-x30 


and H2 - 


hi,h 2 ,"- , his, hso, ■ ■ ■ , /i64 


e 


through direct computation we can observe that 


{||VHl-iy(//l)g||^+d>Hlllo} 
{||vh2 - W ( H 2 ) g \\2 + /1q||vh2|Io} 


where vhi and vhi represent the coefficients corresponding to HI and H2 respectively. The 
above result implies H2 that contains the last 15 biters contaminated by noise is superior to H\ 
in the problem 

min llvi - W(Hi)g\\l + t2||v||o s.t. Hi Hi = \l. Hi 6 K'’'x30_ 

v\,Hi 

1 













This is due to that \\hi^G \\2 < \\hj^G \\2 for hj and hj that belong to the noise subspace and 

signal subspace respectively, and hence ||0 - rnay be smaller than ||vj - hj^G\\\ + /l^||v^||o- 

Therefore, the extra term \\W{H 2 )g\'^ is necessary. 

In what follows, we further reformulate the minimization problem (13.1 11 1 in the matrix form. 
Define 


, Di (3.12) 

D2^p iCi,---, v)e 

D = (D,,D2) 6 
Then problem (13.1 11 1 can be rewritten as 

mm\\V-D\G\\l + \\DlG\\l+X^\\V\\Q, s.t. =/. (3.13) 

The established model can be solved by the alternating minimization scheme between V and 
D. Specifically, given the current estimate (Vj-, D^-), the next iteration updates its value via the 
following scheme; 


Vi+i = arg min HV - (Di)[G||^ + d^||T||o, 

ygRsxW 

< Di+i = arg min \\Vk+\ - D\G\\\ + ||D[G||^, (3.14) 

s.t. D^D^I. 

The first sub-problem of (13.14b has a closed solution as follows: 


where ‘Hi 


Vk^i = -K2((Di)[G) 

is the hard thresholding operator defined by 




if I Gy I >T, 
Otherwise. 


The objective function of the second sub-problem of (13.14b can be reformulated as 

\\Vk^i-DlG\\l + \\DlG\\l 

= Tr((y,+i - D[G)^(y,+i - D[G)) + Tr(G^D2DjG) 


= Tr(Vl,Vk^i) + Tr(G^DD^G) - 2Tr(D[Gy,';i) 


= Tr(y,';iy,+i) + Tr(G^G)- 


■2Tt{D^GVI,). 


(3.15) 


(3.16) 


(3.17) 


'1 

Note that the third equality of (13.17b is from the fact that DD^ - /, which is derived directly 
from D - I. Therefore, the optimal value (Di)i:+i is the solution of the following problem 


maxTr(D[GyJ.,i) s.t. D[Di =/. (3.18) 

^^1 
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Suppose that the SVD of e is Uk+]t-k+\^l^ p where Ut+i e £jt+i e and 

Xk+\ G Then based on Theorem 4 in ll2^ we know that the minimization problem (13.181 1 
has a closed solution 


= Uk^iXl^,. (3.19) 

The value of {D 2 )k+i should satisfy that = I. In other words, it should be the orthocom¬ 

plement of iDi)k+i ■ Since it represents the useless filters and is independent with the update of V, 
the corresponding computation can be neglected here. Following the above analysis, we obtain 
the improved data-driven sparse representation method, which is summarized in Algorithm 1. 
After the learned biters set (Z)i)a: is obtained, the denoised image can be given by 

gK = (W((Di)k)f(9i2(W((Di)k)8)). (3.20) 

where A is the threshold determined by the noise level. 


Algorithm 1 Construction of the data-driven biters for sparse representation 
Input: the noisy g. 

Output: data-driven biters set Di debned by p {h\,h 2 , • ■ ■ , bjj 6 

Initialization: set the number of biters s, and initial biter matrix Do; construct the patch 
matrix G debned by (13.121) . 


Iteration: For k -0,1, - ■ ■ ,K- 1 


(i) update V: 

Vk^i = 9{A(Di)lG)- 


(ii) update Di: 
compute the SVD decomposition of GV, 


k+l 




Uk+itk+iXl^{, and update Di as (Di)k+i = 


3.2. Further analysis of the proposed model 

In this section, we further analyze the convergence of the proposed algorithm. In recent 
years, the convergence of the alternating proximal minimization algorithm for non-convex min¬ 
imization problems has been widely investigated 02711281] . One of the widely used tool is the 
Kurdyka-Lojasiewicz (K-L) property, see |26, 27, 29t] for more details. In literature lE^I . the au¬ 
thors used the results presented in illn to further investigate the convergence of the data-driven 
tight frame construction scheme introduced in section |2] The convergence of the proposed algo¬ 
rithm can be proved very similarly. 

Denote the convex set 


- {''i.i I ^ij - 0 for any i > s}, 
Qb = {D I D^D = /) 


for any V e Then the proposed model (13.131 ) can be reformulated as 

min ||y-D^G||i H-d^liyilo, s.t. D^D^I. (3.21) 

D,VeCly 
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Define 


/(V) = i^iiyii^ + (y)^ g(y^£,) 

(3 22) 

^\\V-D^G\\l,h{D)^ia,{D) 

where tn denotes the indicative function of the set Q, i.e., in(A) = 0 if A e D, and +cx) otherwise. 
Then, the minimization problem (13.211 1 can be rewritten as 


min Uy, D) = f(V) + h(D) + Q{V, D). (3.23) 

D,V 

Similarly to the proof in literature 12^ . we can easily derive the following convergence result. 
Due to the proof requires only minor changes to that presented in 1^ . we omit it here due to 
limited space. 

Theorem 3.1. (The convergence of Algorithm 1 ) The sequence [D^, V^] generated by Algorithm 
1 has at least one limit point. Let (D*, V*) be any limit point of the sequence {D^, V^]. Denote 
y* — (y*; 0) 6 Then (D*, V*) is a stationary point of ( 13.231 ). 


However, the above conclusion only illustrates the sub-sequence convergence property of 
the proposed algorithm. The sequence convergence property can be guaranteed by coupling this 
alternating minimization method with a proximal term. See 27, 28] for more details. In this 
setting, the proposed iteration scheme can be modified as 


f y,t+i = argmini:(y,D^) H-/li:||y-yt||2, 

J v _ 13 241 

I Dk+i^tixgmmJL(Vk+i,D)+pk\\D-Dk\\], 

where Ak,pk 6 (ci,C 2 ) and ci,C 2 are two positive constants. Consequently, the iteration scheme 
of Algorithm 1 can be modified as 




i/Vi+¥ 


(Di)lG+XtVi\ 

1+A j ’ 


iDi)k+\ - 


(3.25) 


where Uk+\ and Xk+i are given by the SVD of + pk(Di)k = Uk+i^k+iXl^^. Then similarly 

to the results shown in ll26ll . we can easily deduce the following conclusion. 


Theorem 3.2. The sequence {D'‘,V^} generated by ( 13.241 ) converges to a stationary point of 

(1123. 


In fact, the lack of sequence convergence is not crucial for the application of image denoising 
due to the reason that the results we are seeking for are not the frame coefficients but the image 
synthesized from the coefficients. Numerical experiments in also demonstrate that the al¬ 
gorithms with and without the proximal term shown in (I3.241 i have almost the same denoising 
performance in terms of PSNR values. The conclusion also exists for the proposed algorithm. 
Therefore, we choose Ak — Pk-^ tn the following experiments. 


10 


















Figure 2: Original images, (a) Barbara, (b) Hill, (c) Lena, (d) Man, (e) Fingerprint, (f) Boat. 


4. Numerical experiments 


In this section, we compare the proposed algorithm (shown as Algorithm 1) with the origi¬ 
nal data-driven tight frame construction scheme lE^ and the recently proposed state-of-the-art 
models 112311241] in the held of AWGN denoising. Both the quality of the recovery images and 
the computational costs of these algorithms are compared. 

The codes of Algorithm 1 and the original data-driven tight frame method Q are written en¬ 
tirely in Matlab. All the numerical examples are implemented under Windows XP and MATLAB 
2009 running on a laptop with an Intel Core i5 CPU (2.8 GHz) and 8 GB Memory. In the follow¬ 
ing experiments, six standard nature images with size of 512 x 512 (see Figure|2|, which consist 
of complex components in different scales and with different patterns, are used for our test. 


4.1. The analysis of the proposed algorithm 

First of all, we simply analyze the computational costs of both the original data-driven tight 


frame construction scheme Il25h and the proposed algorithm (shown as Algorithm 1). It is ob¬ 
served that each iteration of both algorithms includes two stages; hard thresholding and tight 
frame update. For the original algorithm in lE^ . the size of the matrix D^G used for hard thresh¬ 
olding operator is p^ x N, and the tight frame update is done via the S VD of the matrix GV^ with 
the size of p^ x p^. For Algorithm 1, the size of the corresponding matrix used for hard thresh¬ 
olding operator is s x A, and the S VD used for the tight frame update is related to the matrix with 
the size of sxp^. Therefore, the whole computational cost can be reduced obviously by choosing 


* http://www.math.nus.edu.sg/ matjh/download/data 
_driven_tight_frames/data_driven_tight_frame_verl.0.zip 
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a small s. Besides, the computational burden of the analysis operator and the synthesis operator 


generated by Di e 


which corresponds to s times of convolution operator, is also small 


than those produced by D 6 . Hence, the computational cost of generating the denoised 

image (see the formula (I3.201 i) is also smaller than that of the original data-driven tight frame 
method. 

In what follows, the initialization of the data-driven biters set Di and the selection of the 
parameter s are further discussed. The initialization of tight frames has been investigated in 
literature 11250 . and one suitable selection is to use DCT as the initial guess. Denote D = 
{di,d 2 , ■ ■ ■ ,dp 2 } as the tight frame generated hy p x p DCT. We can simply generate the ini¬ 
tial Di by randomly choosing s elements from D. On the other hand, inspired by the discussion 


in section [30] (refer to (iTTl i and Figure [T]i, we can use the signal component S = 2 

i=l 

obtain the initial D\. Specibcally, we choose s elements from S that satisfy ||e ,||2 > e, where e, 
denote the elements of S, and e is a small constant such as 10“^ to guarantee the chosen elements 
are not too small and may represent the main feature of the image. The matrix S can be generated 
by one or two iteration steps (In our experiments we choose two iteration steps) of the original 
data-driven tight frame construction scheme. 

It is noted that one important parameter of the proposed algorithm is the biters number s. In 
the next, three images, Barbara, Lena and Boat, are used for testing the performance of Algorithm 
1 with different values of s and initialization of biters set D\. The size p is chosen to be 8, which 
is a suitable selection by considering both the recovery quality and implementation time. For 
the thresholding parameters A and A, we use the default setting in the recent published code of 
literature i.e., bx d = 3.4cr and A - 2.1a-, which is the suitable selection through many 
trials. The maximum iteration number is set to 25. The performance is quantitatively measured 
by means of PSNR, which is expressed as 


PSNR(M,O--201g| ""J'' | (4.26) 

where u and u* denote the original and restored images, respectively, and N is the total number 
of pixels in the image u. 

Table[T]lists the PSNR values and CPU time for various selection of s values and initialization 
of Di. Here s - 10, • • • ,50 represent that the corresponding initialization of Di is obtained by 
the signal component S, and DCT, s - 10,20 demonstrate that the initialization is given by the 
DCT. It is observed that i = 20 or 30 is a suitable selection for Algorithm 1, which means that 
s can be chosen as /7^/3 or /7^/2 approximately. Besides, the performance of the algorithm with 
the initialization of D\ from S overall outperforms that using the DCT as the initialization. In the 
following experiments, we use the better initialization strategy for D\. 


4.2. Denoisingperformance evaluation on additive white Gaussian noise 

In this section, we report the experimental results, comparing the proposed algorithm with the 
original data-driven tight frame construction scheme ll^ . on the denoising of additive Gaussian 
noise. Six natural images with size of512x512are used for our test. Considering both the 
recovery quality and computational time, we set the maximum iteration number to 25 for both 
algorithms through all experiments. In this experiment, we consider the biters with size of 8 x 8 
and 16x16 respectively. The selection of the thresholding parameters A and A is the same as 
that mentioned above. Tablel^lists the PSNR values and CPU time of different algorithms. Here 
the original data-driven tight frame construction scheme ll25n is abbreviated as “DDTF”, and 
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Table 1: The comparison of the performance of Algorithm 1 with different s and initialization 


Image 

noise 

index 

5 = 10 

5 = 20 

II 

o 

o 

II 

i = 50 

DCT , i = 20 

DCT , 5 = 30 


0 - = 30 

PSNR 

26.16 

27.84 

28.57 

28.56 

28.53 

27.83 

28.54 


Time 

1.10 

1.62 

2.21 

2.79 

3.39 

1.59 

2.15 

Barbara 

II 

O 

PSNR 

25.21 

26.54 

27.15 

27.12 

27.07 

26.69 

27.08 

Time 

1.06 

1.68 

2.16 

2.73 

3.43 

1.57 

2.15 


II 

O 

PSNR 

24.94 

25.64 

26.00 

25.89 

25.88 

25.54 

25.80 


Time 

1.09 

1.63 

2.19 

2.73 

3.26 

1.73 

2.11 


0 - = 30 

PSNR 

30.41 

30.58 

30.59 

30.51 

30.44 

30.52 

30.51 


Time 

1.09 

1.68 

2.17 

2.72 

3.29 

1.58 

2.14 

Lena 

II 

4^ 

O 

PSNR 

29.18 

29.23 

29.18 

29.12 

29.02 

29.08 

29.14 

Time 

1.07 

1.64 

2.19 

2.72 

3.20 

1.56 

2.13 


II 

O 

PSNR 

28.15 

28.16 

28.07 

27.99 

27.86 

28.06 

28.04 


Time 

1.10 

1.63 

2.20 

2.74 

3.29 

1.57 

2.12 


O 

II 

b 

PSNR 

28.01 

28.48 

28.53 

28.52 

28.47 

28.51 

28.48 


Time 

1.07 

1.63 

2.18 

2.72 

3.31 

1.57 

2.11 

Boat 

II 

4^ 

O 

PSNR 

26.98 

27.21 

27.21 

27.20 

27.15 

27.15 

27.14 

Time 

1.06 

1.65 

2.19 

2.70 

3.26 

1.72 

2.12 


II 

O 

PSNR 

26.11 

26.24 

26.17 

26.15 

26.08 

26.14 

26.16 


Time 

1.08 

1.64 

2.15 

2.73 

3.27 

1.56 

2.13 


“DDTF; 8” represents the DDTF method with p - S. Similarly, “Algl(20);8” denotes Algorithm 
1 with p = 8 and s = 20, where the number in the bracket represents the value of s, i.e., the 
number of chosen filters. Two values of s, which approximately equal to p^/3 or p^/2, are tested 
here. 

From the results in Table |2] we observe that the proposed algorithm overall outperforms the 
DDTF method in both the recovery quality and computational time, especially for the higher 
noise level. This is due to more filters learned from the SVD of the matrix GV^ are influenced 
by the several noise. As a result, the coefficients generated by these Alters may not be sparse, 
and removing these Alters can improve the recovery quality and reduce the computational cost 
meanwhile. In general, comparing the indexes in Table |2] we can easily draw the following 
conclusions. 

1) The DDTF method with p = 8 takes about 4.0 ~ 4.1s, Algorithm 1 with p = 8 and s = 20 
takes about 1.6 ~ 1.7s, and Algorithm 1 with p = 8 and s = 30 takes about 2.1 ~ 2.2s. The 
computational time of the corresponding algorithms increases more than ten times due to the fact 
that both the size of the support set of Alters and the number of Alters increase four times. 

2) The small s is suitable for images with less textures and higher noise level, and the im¬ 
provement of recovery quality becomes more obvious while noise level increases. For the case 
of p = 8, it is observed that Algorithm 1 with i = 20 obtains the best PSNRs for almost all the 
condition that cr > 40 (except the Barbara image which is rich in terms of textures). 

3) These algorithms with p = 16 outperform those with p = 8 for the images Barbara, Lena 
and Fingerprint, especially when the noise is large. Maybe it is the reason that these images have 
more complex texture regions. 

Figure [3 shows the denosing results of different algorithms, and the corresponding learned 
Alters are presented in Figure|4] As observed, part of these Alters generated by the DDTF method 
are contaminated by noise terribly due to the influence of high noise. However, these Alters are 
not included in the Alters generated by our method. An visual observation of the results for the 
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Figure 3: Visual comparison of denoising results, (a) Noisy image with (t = 40, (b) result by DDTF method with p = 8, 
(c) result by Alg.l with p = 8 and s = 30, (d) result by DDTF method with p = 16, (e) result by Alg.l with p = 16 and 
5 = 80, (f) result by Alg.l with p = 16 and 5 = 120. 


Lena image can also be obtained in Figure |5] 

In what follows, we compare Algorithm 1 with some of the state-of-the-art denoising meth¬ 
ods l23l proposed very recently. The codes of these algorithms are all written and imple¬ 
mented in Matlab, and hence the comparison is fair. The numerical experiments in demon¬ 
strate that the DDTF method runs much faster than the K-SVD method with comparable perfor¬ 
mance on denoising, thus so is the proposed algorithm. Recently, many patch-based denoising 
models which utilize the nonlocal similarity of image patches were proved to be superior to the 
K-SVD method, and hence the proposed algorithm in the denoising performance. However, the 
corresponding computational amount is much greater. Tabled lists the PSNR values and com¬ 
putational time of the three algorithms. Note that “LASSC” and “WNNM” represent the models 
in EH and EH respectively. It is observed that the implementation time of the compared 
methods is too longer than that of Algorithml, though the denoising performance is better. 

Finally, in order to further verify the stability of the proposed method, the experiments are 
conducted on twenty-four standard images randomly selected from a large image dataset which 
is constructed by the computer vision group of University of Granada. These test images with 
size of512x512 (see Figure 6) can be freely downloaded from the network^. Table |4] lists the 
PSNR values of different algorithms. Here “img. 1-img. 24” denotes the test images in Figure 6 
in sequence. Once again we observe that the proposed method can achieve the better PSNRs in 
most cases. Here we only consider the case of p = 8. The similar conclusion can be obtained by 


^http://see.xidian.edu.cn/faculty/wsdong/Data/LASSC_Denoising.rar 
^http://www4.comp.polyu.edu.hk/ cslzhang/code/WNNM_code.zip 
'*http://decsai.ugr.es/cvg/dbimagenes/index.php 
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Table 2: The comparison of the performance of Algorithm 1 and the original algorithm {2S\ 


image Noise 

DDTF[2^;8 

Algl(20);8 

Algl(30);8 

DDTF[2^;16 

Algl(80);16 

Algl(120);16 

20 

30.60 

29.34 

30.52 

31.00 

29.91 

30.85 

30 

28.45 

27.84 

28.57 

28.94 

28.58 

29.03 

Barbara 

26.95 

26.54 

27.15 

27.47 

27.41 

27.61 

25.75 

25.64 

26.00 

26.30 

26.42 

26.52 

60 

24.70 

24.84 

24.97 

25.39 

25.61 

25.47 

70 

23.82 

24.12 

24.06 

24.62 

24.93 

24.81 

Ave. Time (s) 

4.05 

1.64 

2.18 

54.75 

18.41 

26.49 

20 

30.20 

30.13 

30.23 

30.16 

30.07 

30.15 

30 

28.54 

28.65 

28.66 

28.52 

28.56 

28.60 

H.,, « 

27.33 

27.49 

27.53 

27.40 

27.47 

27.55 

26.43 

26.72 

26.70 

26.56 

26.68 

26.74 

60 

25.68 

26.05 

25.99 

25.82 

26.09 

26.09 

70 

25.02 

25.49 

25.44 

25.22 

25.51 

25.57 

Ave. Time (s) 

4.05 

1.63 

2.19 

55.08 

18.35 

26.48 

20 

32.31 

32.41 

32.46 

32.38 

32.44 

32.46 

30 

30.31 

30.58 

30.59 

30.47 

30.63 

30.64 

T 40 

50 

28.83 

29.23 

29.18 

29.09 

29.31 

29.33 

27.64 

28.16 

28.07 

28.02 

28.35 

28.35 

60 

26.67 

27.22 

27.14 

27.10 

27.47 

27.45 

70 

25.84 

26.47 

26.38 

26.32 

26.80 

26.76 

Ave. Time (s) 

4.05 

1.64 

2.18 

55.37 

18.40 

26.47 

20 

30.02 

29.81 

29.98 

29.79 

29.62 

29.76 

30 

28.20 

28.27 

28.31 

27.98 

27.98 

28.07 

40 

Man 

26.97 

27.15 

27.15 

26.84 

26.86 

26.90 

26.05 

26.28 

26.27 

25.96 

26.10 

26.16 

60 

25.29 

25.62 

25.55 

25.26 

25.47 

25.46 

70 

24.64 

25.05 

24.99 

24.71 

24.96 

24.98 

Ave. Time (s) 

4.08 

1.64 

2.18 

55.24 

18.42 

26.45 

20 

28.36 

28.42 

28.44 

28.36 

28.37 

28.41 

30 

26.17 

26.29 

26.28 

26.26 

26.32 

26.33 

Fingerprint 

24.66 

24.82 

24.81 

24.86 

24.90 

24.92 

23.47 

23.65 

23.64 

23.82 

23.88 

23.95 

60 

22.35 

22.63 

22.63 

22.97 

23.04 

23.11 

70 

21.56 

21.82 

21.77 

22.28 

22.39 

22.43 

Ave. Time (s) 

4.10 

1.66 

2.19 

55.47 

18.69 

26.78 

20 

30.35 

30.19 

30.40 

30.24 

30.13 

30.25 

30 

28.39 

28.48 

28.53 

28.33 

28.43 

28.44 

5? 

27.00 

27.21 

27.21 

27.01 

27.13 

27.05 

25.95 

26.24 

26.17 

26.00 

26.16 

26.20 

60 

25.10 

25.37 

25.34 

25.19 

25.38 

25.44 

70 

24.34 

24.70 

24.65 

24.52 

24.69 

24.80 

Ave. Time (s) 

4.03 

1.65 

2.17 

55.35 

18.55 

26.44 
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Figure 4: The data-driven tight frame filters, (a) result by DDTF method with p = S, (b) result by Alg. 1 with p = 8 and 
V = 20, (c) result by Alg.l with p = 8 and i = 30, (d) result by DDTF method with p = 16, (e) result by Alg.l with 
p = 16 and i = 80, (f) result by Alg.l with p = 16 and i = 120. 


Figure 5: Visual comparison of denoising results, (a) Noisy image with cr = 30, (b) result by DDTF method with p = 8, 
(c) result by Alg.l with p = 8 and i = 20, (d) result by Alg.l with p = 8 and i = 30, (d) result by DDTF method with 
p = 16, (f) result by Alg.l with p = 16 and i = 80. 
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Table 3: The comparison of the performance of different algorithms: the given values are PSNR (dB)/CPU time(second) 


Image 

Noise ■ 

LASSC [2^ 

WNNM [24] 

Algorithm 1(30);8 

PSNR 

Time 

PSNR 

Time 

PSNR 

Time 


o- = 20 

30.57 

97.56 

30.80 

788.45 

30.23 

2.19 

Hill 

a- = 30 

28.91 

135.11 

29.20 

1228.86 

28.66 

2.19 


O 

II 

b 

27.91 

271.70 

28.05 

1423.83 

27.53 

2.19 


O- = 20 

28.97 

97.56 

29.05 

781.6 

28.44 

2.22 

Fingerprint 

a- = 30 

26.92 

136.20 

27.11 

1357.58 

26.28 

2.18 


O 

II 

b 

25.63 

269.47 

25.71 

1415.43 

24.81 

2.22 


O- = 20 

30.53 

107.10 

30.73 

781.62 

29.98 

2.22 

Man 

O 

II 

b 

28.70 

152.41 

29.00 

1534.87 

28.31 

2.19 


o 

II 

b 

27.67 

310.80 

27.82 

1467.76 

27.15 

2.16 


a- = 20 

30.70 

91.88 

30.96 

754.44 

30.40 

2.23 

Boat 

o- = 30 

28.83 

135.16 

29.16 

1221.57 

28.53 

2.18 


O 

II 

b 

27.60 

263.17 

27.85 

1226.50 

27.21 

2.19 


considering the case of p - 16. 


5. Conclusion 


In this paper, inspired by the fact that part of the data-driven tight frame filters learned by 
the recently proposed method ll25n are influenced by the noise severely, we propose an improved 
data-driven Alters learning method. In our method, we divide the matrix used for generating the 
tight frame Alters into two part, cotTesponding to the so called signal subspace and noise sub¬ 
space. Then only the bases spanning the signal subspace are used for constructing the learned 
Alters. This means that the Alters contaminated by the noise severely are excluded for the sparsity 
representation of images. Numerical experiments demonstrate that our method overall outper¬ 
forms the original data-driven tight frame construction scheme. 
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Table 4: The comparison of the performance of Algorithm 1 and the original algorithm flSl 


Image 

img. 1 

img. 2 

img. 3 

(T 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTF[2a;8 

29.77 

28.29 

27.02 

26.01 

31.65 

30.18 

28.95 

27.90 

28.09 

26.75 

25.71 

24.76 

Algl(20);8 

30.00 

28.62 

27.42 

26.39 

32.07 

30.70 

29.59 

28.72 

28.19 

26.94 

25.95 

25.02 

Algl(30);8 

29.98 

28.50 

27.35 

26.37 

32.02 

30.54 

29.49 

28.59 

28.22 

26.88 

25.90 

24.92 

Image 

img. 4 

img. 5 

img. 6 

cr 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTF[25];8 

25.10 

23.67 

22.62 

21.78 

27.21 

25.99 

24.95 

24.01 

26.20 

24.99 

24.14 

23.46 

Algl(20);8 

24.84 

23.58 

22.63 

21.84 

27.19 

26.06 

25.09 

24.18 

25.42 

24.63 

23.99 

23.44 

Algl(30);8 

25.08 

23.70 

22.69 

21.86 

27.25 

26.08 

25.08 

24.17 

25.88 

24.87 

24.13 

23.50 

Image 

img. 7 

img. 8 

img. 9 

cr 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTF[25];8 

29.33 

28.13 

27.11 

26.22 

31.07 

29.56 

28.54 

27.55 

30.89 

29.49 

28.29 

27.17 

Algl(20);8 

29.50 

28.41 

27.44 

26.64 

31.42 

29.94 

29.05 

28.21 

31.12 

29.88 

28.78 

27.78 

Algl(30);8 

29.48 

28.36 

27.42 

26.59 

31.38 

29.88 

28.99 

28.09 

31.13 

29.81 

28.75 

27.70 

Image 

img. 10 

img. 11 

img. 12 

cr 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTF[25];8 

26.20 

25.03 

24.13 

23.48 

26.31 

25.20 

24.32 

23.59 

28.49 

27.27 

26.20 

25.29 

Algl(20);8 

25.95 

24.95 

24.17 

23.56 

26.27 

25.23 

24.48 

23.76 

28.54 

27.45 

26.43 

25.62 

Algl(30);8 

26.15 

25.03 

24.22 

23.59 

26.32 

25.24 

24.46 

23.74 

28.58 

27.44 

26.45 

25.56 

Image 

img. 13 

img. 14 

img. 15 

cr 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTFr251;8 

30.17 

29.11 

28.11 

27.28 

28.09 

26.64 

25.59 

24.76 

27.84 

26.45 

25.34 

24.45 

Algl(20);8 

30.28 

29.38 

28.56 

27.91 

28.14 

26.80 

25.78 

25.06 

27.40 

26.32 

25.39 

24.59 

Algl(30);8 

30.32 

29.37 

28.52 

27.83 

28.19 

26.80 

25.77 

24.98 

27.75 

26.53 

25.48 

24.64 

Image 

img. 16 

img. 17 

img. 18 

cr 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTF[25];8 

27.84 

26.59 

25.47 

24.62 

25.19 

24.12 

23.35 

22.75 

24.23 

22.92 

22.06 

21.37 

Algl(20);8 

27.91 

26.76 

25.74 

24.86 

24.99 

24.07 

23.37 

22.78 

23.61 

22.66 

21.95 

21.34 

Algl(30);8 

27.95 

26.70 

25.69 

24.84 

25.12 

24.13 

23.36 

22.82 

24.01 

22.86 

22.06 

21.42 

Image 

img. 19 

img. 20 

img. 21 

cr 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTFr251;8 

23.24 

22.09 

25.47 

21.30 

30.02 

28.92 

28.02 

27.19 

26.92 

26.04 

25.37 

24.85 

Algl(20);8 

22.50 

21.72 

25.74 

21.11 

30.18 

29.25 

28.48 

27.80 

26.91 

26.07 

25.50 

25.07 

Algl(30);8 

22.84 

21.89 

25.69 

21.21 

30.19 

29.23 

28.42 

27.71 

26.97 

26.10 

25.52 

25.07 

Image 

img. 22 

img. 23 

img. 24 

cr 

30 

40 

50 

60 

30 

40 

50 

60 

30 

40 

50 

60 

DDTF[2a;8 

29.38 

27.99 

26.80 

25.93 

26.53 

25.23 

24.31 

23.68 

26.19 

24.98 

24.03 

23.27 

Algl(20);8 

29.22 

28.07 

27.02 

26.23 

26.28 

25.21 

24.38 

23.84 

26.12 

25.03 

24.15 

23.44 

Algl(30);8 

29.35 

28.14 

27.02 

26.24 

26.50 

25.27 

24.42 

23.76 

26.22 
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